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ABSTRACT 

Both kinetochore function and sister chromatid 
cohesion can depend upon pericentromere chroma- 
tin structure, and factors associated with hetero- 
chromatin have been proposed to have general, 
conserved roles in distinguishing centromeres and 
pericentromeres and in conferring pericentromere- 
intrinsic functions. We applied genome-wide 
sequencing approaches to quantify RNA expres- 
sion, DNA methylation and histone modification 
distributions in maize (Zea mays), focusing on two 
maize chromosomes with nearly fully sequenced 
centromeres and pericentromeres. Aside from the 
presence of the Histone H3 variant common to all 
centromeres, Centromeric Histone H3 (CENH3), 
we found no RNA expression or chromatin modifi- 
cations that clearly differentiate pericentromeres 
from either centromeres or from chromosome 
arms, nor did we identify an epigenetic signature 
that accurately predicts CENH3 location. RNA 
expression and chromatin modification frequencies 
were broadly associated with distance from centro- 
meres, gradually peaking or dipping toward arms. 
When interpreted in the context of experimental 
data from other systems, our results suggest that 
centromeres may confer essential functions (such 
as cohesion retention) to flanking sequence regard- 
less of the local heterochromatin profile. 

INTRODUCTION 

The sequence of steps in chromosome movement during 
cell division depends on a unique chromatin environment 
in centromeres. Many factors can contribute to the critical 



attributes of centromere chromatin, but the histone 
variant CENH3 (also referred to as CENP-A) is viewed 
as a defining feature of an active centromere [reviewed 
in (1,2)]. In organisms where centromeres exist within 
a single domain on each chromosome, the adjacent 
domains are generally referred to as pericentromeres. 
Pericentromeres also have essential roles in chromosome 
movement during cell division as they provide the frame- 
work for keeping sister chromatids oriented correctly until 
anaphase. Between prophase and anaphase, sister chroma- 
tids remain attached along their pericentromeres specific- 
ally, as cohesin is removed everywhere else [reviewed in 
(3)]. Various factors associated with heterochromatin 
during interphase have been proposed to set the stage 
for the retention of cohesin at pericentromeres, including 
Heterochromatin Protein 1 (HPl), trimethylation of lysine 
9 on histone H3 (H3K9me3) and RNAi (4,5). Transient 
deposition of histone modifications during early mitosis 
has also been proposed to function in this regard, such 
as phosphorylation of serines 10 and 28 on histone H3 
and phosphorylation of serine 121 on histone H2A (6,7). 

Extensive investigation in the fission yeast Schizo- 
saccharomyces pombe pericentromere chromatin has led 
to the discovery that specific chromatin factors and an 
RNAi-related mechanism are key not only for proper 
cohesin dynamics but also for proper placement of 
CENH3 (8,9). The DNA composition itself contributes 
to pericentromere function, as repetitive sequences in the 
pericentromere are transcribed and give rise to siRNAs 
that recruit heterochromatin factors (5,10,11). Despite 
the comparatively simple composition and small size of 
S. pombe centromeres and pericentromeres, similarities 
have been identified in the much more complex centromer- 
ic regions of diverse plants and animals. In addition to 
heterochromatin factors in pericentromeres, histone modi- 
fications associated with euchromatin have been observed 
in centromeres of both S. pombe and animals (12,13). 
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Schizosaccharomyces pombe is different from many other 
eukaryotes in that it lacks DNA methylation, which is an 
important component of heterochromatin in organisms 
which are capable of it. Hence extrapolation from 
S. pombe suggests that DNA methylation could contribute 
to pericentromere heterochromatin function (though not 
actually in S. pombe itself). 

The similarities between S. pombe and other organisms 
have led to much conjecture about a conserved, functional 
connection between pericentromere heterochromatin and 
both centromere function and sister chromatin cohesin 
(14,15). However, the extent to which the S. pombe 
model appHes to other organisms, with its rehance on 
control elements intrinsic to pericentromeres, is unclear 
[see discussions in (6,16,17) and references therein]. 
These arguments against broad appHcabihty include: 
(i) a general lack of mitotic centromere or cohesin 
defects in RNAi and chromatin modification mutants in 
plants and animals; (ii) Profound differences in both 
DNA structure and siRNA expression patterns between 
S. pombe and other organisms; (iii) Dispersed rather than 
pericentromere-concentrated heterochromatin marks in 
plants with large genomes; (iv) Existence of functional 
centromeres that lack canonical pericentromere sequences 
or even heterochromatin domains (18,19); and (v) Lack of 
a clear Hnk between HPl and centromere function in 
animals (20-22) and an established euchromatic function 
for the HPl homolog in plants (23-25). These observa- 
tions support the notion that the predominant controUing 
factors are derived from within the centromere itself, 
both for internal control (CENH3-containing chromatin) 
and external (pericentromere cohesion) (17). 

Pericentromere-intrinsic control models derived from 
S. pombe predict that the pericentromere chromatin 
should be functionally and visibly distinct both from the 
centromere chromatin and the arm chromatin. The 
presence of CENH3 within the centromere specifically is 
one such distinction; however, centromeres also contain 
measurable amounts of cannonical H3 and of course 
H2A, H2B and H4 [reviewed in (1,2)]. Within the 
scope of histone modifications, DNA methylation, and 
RNA expression patterns, one would expect to see a 
pericentromere chromatin signature that could be con- 
nected to its functional role. Such a signature is clearly 
evident in S. pombe and others (12). Maize presents a 
useful organism for direct testing of this expectation 
because of its large, sequenced centromeres and high- 
resolution maps of CENH3 occupancy (26,27) and 
because of its rich history of genetic and cytological 
analysis of chromosome structure. Previous cytological 
analyses in maize have revealed that its heterochromatin 
is dispersed along the length of its chromosomes rather 
than being highly concentrated just at pericentromeres, 
and the euchromatin histone modification H3K4me2 
that is enriched in S. pombe and animal centromeres 
shows no enrichment in maize centromeres (28-30). 

In order to more fully characterize RNA expression 
and chromatin patterns in maize centromere regions 
and to search for evidence of a pericentromere chromatin 
signature, we examined siRNA and poly(A) RNA 
expression, DNA methylation, and four euchromatic 



histone modifications patterns — H3K4me3, H3K36me3, 
H3K27me3 and acetylation of H3K9 — across chromo- 
somes in maize using both our own and publicly available 
deep sequence data. We found no evidence for anything 
pecuHar to pericentromeres, but instead found that the 
centromere itself was the predominant feature in terms 
of large-scale patterns. The general trend manifest from 
these analyses was the centromere rather than pericen- 
tromere providing the most visible mark along the 
chromosome, representing either the top or bottom of a 
large curve with opposite extremes toward the arms. 

MATERIALS AND METHODS 

Derivation of small RNA sequencing libraries 

Wild-type maize seeds, B73 stock, were germinated in 
vermiculite at 30°C for 3 days and the last 2-3 mm of 
the primary root tips cut off with a razor blade and 
frozen on dry ice. After grinding to a fine powder in 
liquid nitrogen, small RNA was extracted using the 
mirVana™ miRNA Isolation Kit (Ambion). A Solexa/ 
Illumina Hbrary was then prepared using oligos as 
described in (31) and a Hgation scheme derived from 
(32) and (33), which selectively captures Dicer products 
and other small RNAs with both a 3^-OH and a 
5^-monophosphate. In order to distinguish this sample 
from others also included in the same sequencing run, 
we included a barcode CGT on the 5^-adapter. Samples 
were sequenced with the Illumina Genome Analyzer II. 
The 5^-barcode and 3^-adapter sequence were trimmed 
off as described previously except that the 3^-adapter 
was identified and trimmed based on a perfect match to 
the first 8 nt of the adapter (CTGTAGGC) (31). The raw, 
untrimmed reads are available at the NCBI Sequence 
Read Archive, accession SRR218319.1. 

Derivation of CENH3-chromatin immunoprecipitation 
bisulfite sequencing libraries 

Chromatin immunoprecipitation (ChIP) was carried out 
as previously described (34) using anti-CENH3 antibodies 
(35). The 14-day-old B73 seedlings were used as the source 
tissue. The ChlPed DNA was treated with sodium bisulfite 
and prepared for Illumina sequencing using a method 
similar to that of Lister et al. (36). Reads are available 
at the NCBI Sequence Read Archive, accession 
SRR218318.1. 

Read processing and alignments 

After removal of barcodes and adapters from the small 
RNA reads, known microRNAs were filtered out by 
Blasting against a set of maize mature miRNA sequences 
obtained from miRBase (release 16) [www.mirbase.org; 
(37)]. We used blastall with default parameters except 
the expectation value {E) was set to 1 e-6. The resulting 
miRNA-filtered reads, which we refer to as siRNA reads, 
were sorted by read length to produce individual fasta files 
corresponding to each length from 20 to 26 nt. Each file 
was then aligned to the complete maize genome version 2, 
obtained from www.maizesequence.org. AHgnments were 
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carried out with bowtie software (38). For analysis of 
uniquely-aligning reads, i.e. those with a single best 
alignment, the following parameters were used: '-n 0 -1 
20 -m 1 -best'. For analysis of total aligning reads, the 
parameters were: '-n 0 -1 20 -M 1 -best'. Alignment of the 
454-sequenced CENH3 reads (27) to the reference genome 
to mark the centromeres in figures was carried out 
with bowtie similarly to siRNA reads except that only 
uniquely-aligning reads were analyzed and a longer seed 
length was used '-n 0 -1 100 -m 1 -best'. 

Control reads were derived from a sample of randomly- 
fragmented B73 genomic DNA sequenced with Illlumina 
paired-end technology (39). We combined the reads from 
both ends into a single set, which we treated as single-end, 
trimmed to the appropriate length (20, 26, or 36 nt), 
and processed identically to the siRNA reads (except 
without removal of miRNA matches). This control was 
intended primarily as an indicator for domain-level 
effects from errors in the physical map, particularly 
omissions, as a large proportion of centromere repeats 
have not yet been placed on the physical map. However, 
any biases inherent to Illumina sequencing may also be 
reflected in this sample. 

Bisulfite reads were aUgned to the genome and to 
individual repeat sequences using BS Seeker (40). Only 
reads that could be unambiguously assigned to a single 
locus with no mismatches were used for quantifying 
methylation status, using default parameters except '-t N 
-e 36 -m 0'. Control, non-bisulfite-treated reads from 
Tenaillon et al. (39) were processed exactly the same. 
The low error reads were derived by extracting only the 
first 36 nt; the high error reads were from nucleotides 49 to 
84 of the reads. 

Poly (A) enriched, ChIP reads (35 nt) and the control 
reads (low error reads; see above) were processed similarly 
to the siRNA reads, but without removal of miRNA 
matches. 



RESULTS 

siRNA expression patterns 

Given the importance of siRNAs in directing histone 
modifications in S. pombe pericentromeres and in direct- 
ing DNA methylation of repetitive elements in plants, we 
first surveyed siRNA expression patterns genome-wide 
for evidence of a pericentromeric chromatin signature. 
We prepared Illumina small RNA sequencing libraries 
from RNA extracted from maize root tips (which 
contain a large fraction of dividing cells) and aUgned the 
resulting siRNA reads onto the maize genome. A compli- 
cation with aUgning short reads to a highly repetitive 
genome is that in many cases, it is impossible to determine 
which of the multiple identical loci is the source of a read. 
Two common solutions are to disregard all but unique 
reads (those that can unambiguously be assigned to a 
single locus), or to include all reads and to assign a 
locus at random from the possible options. One method 
loses information, especially when deaUng with highly 
repetitive centromeres; the other entails a higher level of 
misinformation, since it will assign a portion of repetitive 



reads to the wrong loci. We opted to use both methods, as 
displayed in Figure 1 and Supplementary Figure SI. Since 
one of the distinguishing characteristics of functionally 
distinct types of siRNAs is length, we spHt the siRNA 
reads into sets based on length, from 20 to 26 nt. The 
most abundant, and potentially most interesting because 
of their connection to heterochromatin, are the 22-nt and 
24-nt types (41,42) 

Centromeres were defined to include all the chromatin 
within the outer bounds of the region of the chromosomes 
enriched for CENH3 [defined by ChIP with CENH3 
antibodies (27)]. In all but two chromosomes (chromo- 
somes five and seven) a single, MB-scale domain of 
CENH3 was designated as the centromere in a prior 
study (27). These regions are also referred to as centro- 
mere cores, designating that they are defined precisely by 
CENH3 occupancy. Although centromeric sequences are 
highly repetitive, there is still enough variation in the 
DNA content to ahgn even short reads uniquely, especial- 
ly in centromeres two and five, which have been traversed 
and sequenced (27). For the 22-nt siRNAs, 11% of the 
putative centromere matching reads were unique. Small 
RNAs have also been successfully mapped to rice centro- 
mere 3 (43). We note that in some rice centromere cores, 
CENH3 domains are interspersed with extended regions 
of H3-containing chromatin (44). This is probably true in 
maize as well, but at this stage we lack the resolution to 
determine where such regions may lie. 

The 22-nt siRNAs were observed at a consistently 
low basehne level across chromosome domains, with 
large spikes in specific regions (these are not known 
microRNAs). In contrast, the chromosomal distribution 
of 24-nt siRNAs revealed a pattern of higher density 
toward the arms and a low point within the centromere, 
reminiscent of both recombination frequency and gene 
density (26,45,46). On chromosome 5 there was a particu- 
larly low density of 24-nt siRNAs over the region 
associated with CENH3, however a similar dip was not 
observed on chromosome 2. We also examined the 
chromosomal distribution of other lower abundance 
siRNAs between 20 and 26 nt on chromosome 2; 
however, none showed anything unusual in the 
pericentromeric region (Supplementary Figure SI A and 
B). These high-resolution analyses of siRNAs within 
centromere core regions and surrounding areas provide 
additional evidence that centromeric domains are compat- 
ible with both transcription and siRNA production, but 
only at a low level in both centromere and pericentromere 
relative to chromosome arms in root tip cells. 

DNA methylation patterns 

The lack of a pericentromeric 24-nt siRNA signature sug- 
gested that associated DNA methylation might also lack a 
distinguishing pericentromeric pattern. DNA methylation 
in plants exists in three forms: cytosine methylation in CG, 
CHG and CHH contexts, where H is anything but G. 
The CHH type is most heavily dependent upon 24-nt 
siRNAs [reviewed in (47)]. Our intent was to focus on 
the centromere core regions, which had been shown in 
some cases to display hypomethylation in prior 
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Figure 1. Comparison of siRNA abundance across chromosome domains. (A and B) Chromosome-wide views and centromere close-ups of 24- and 
22-nt siRNAs. (A) Whole chromosome views of chromosomes 2 and 5, and (B) close up views of the centromeres plus 5 Mb of sequence on either 
side. For small RNAs, the Y-axes show read counts, normalized per million total genomic reads for each length at each 500-kb or 50-kb interval. 
Unique siRNAs are those corresponding to a single locus whereas total siRNAs include those which could be derived from multiple loci (one of 
which was chosen at random). Note that some peaks extend off the chart. The CENH3 reads mark unique loci within centromeres and are derived 
from CENH3 ChIP [described in (27)]. CENH3 read count was normalized per hundred thousand unique genomic reads. See also Supplementary 
Figure SI. (C) Average siRNA count for each chromosome domain. Centromeres are the domains occupied by CENH3, as defined by the CENH3 
Chip reads. For the purposes of this study, pericentromeres extend 5 MB on either side of the centromeres, and arms are the rest of the chromo- 
somes beyond the pericentromeres. Error bars are standard deviations arising from differences in the values from the 10 chromosomes. The control 
reads are derived from an Illumina-sequenced, randomly-fragmented genomic DNA sample (39) and are included to provide an estimate of the bias 
introduced in each domain due to inaccuracies in the physical map and biases arising from Illumina sequencing. 
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immunostaining studies using methyl-cytosine-specific 
antibodies (48-51). To this end, we first performed ChIP 
using maize CENH3 antibodies to enrich for centromeric 
sequences. We then treated the sample with bisulfite and 
sequenced it with Illumina technology. The experiment 
conducted in this way clearly delineated the boundaries 
of the centromere cores while retaining sufficient read 
depth to cover the rest of the maize genome (Figure 2 
and Supplementary Figure S2A and B). The analysis 
revealed gentle, chromosome-level trends of CG and 
CHG methylation increasing toward centromeres and 
decreasing toward arms, as has been observed in many 
other species (52,53). In contrast, CHH methylation was 
generally similar across different domains of the chromo- 
somes as has been observed previously in rice (52). 
Although overall methylation was similar in centromeres 
and pericentromeres, within the centromere core we noted 
a subtle increase in CG methylation and decrease in CHG 
methylation, with notable variation among centromeres 
(Figures 2, Supplementary Figure S2A and B). Such vari- 
ation may be attributable to varying sequence coverage 
and/or the relative sizes of the CentC arrays in different 
centromeres, which range from <300kb to several 
megabases (27,54). 

This genome-wide analysis was initially limited to 
unique loci in order to ensure high accuracy in quantifi- 
cation of methylation frequency (Supplementary Figure 
S2C). To assess the level of methylation within repetitive 
areas, which are under-presented in the genome assembly, 
we analyzed reads that match the tandem repeat (satellite) 
CentC and the retrotransposon CRM2 independently of 
map position and copy number (Figure 2D). These two 
elements are the most abundant of the repetitive elements 
in maize centromeres (26,27). We found that CHH methy- 
lation was present at very low levels for CentC and CRM2, 
similar to CHH methylation at unique centromere loci 
(~1% of cytosines in a CHH context were methylated 
in each case) The average CHG methylation values for 
both CentC and CRM2 as a whole were lower than the 
CHG values in unique centromeric loci (42% for CentC 
and 45% for CRM2 as compared with 65% for unique 
centromeric loci). These results may partially explain the 
prior observation that active maize centromeres stain 
poorly with methyl-cytosine-specific antibodies in maize 
and other plants (44,46). In contrast, CG methylation 
in centromere repeats was not notably different from 
either single copy pericentromeric or single copy centro- 
meric regions. The data reveal that 88% of the CG 
dinucleotides within the CentC repeats are methylated 
on average (Figure 2D); however it is likely that the 
levels of CG methylation vary within long arrays (44-47). 

mRNA and euchromatin histone modification patterns 

Transcription has been observed at both centromeres and 
pericentromeres in many other organisms, as evidenced 
by sequence identity with known centromere- or 
pericentromere- specific repeats. In S. pombe, transcription 
driven by RNA polymerase II occurs at pericentromeres 
but not centromeres (11,12). Similarly, in rice, genes exist 
in centromere core regions but they tend to He within 



subdomains of normal H3 chromatin and not the 
subdomains identified by CENH3 (44). Nevertheless, a 
direct comparison is difficult because repetitive sequences 
in plants can be transcribed via a compHcated interaction 
of three RNA polymerases, II, IV and V, of which only II 
exists in S. pombe [reviewed in (55)]. 

Maize has the distinction of being the first organism for 
which centromere RNA was found to associate directly 
with CENH3 chromatin (56) but the most detailed 
analyses have been done in mammals, where transcription 
has been associated with both chromatin modifications 
and kinetochore integrity [reviewed in (57)]. In order to 
quantify RNA expression over the entire centromere and 
pericentromere in maize, we examined a previously 
pubHshed set of poly(A)-enriched RNA fragments (58). 
We took the raw, unfiltered data from this study, and 
produced genome-wide alignments to measure steady- 
state RNA levels (Figure 3). The clear image that 
emerges from this is a general depletion of poly(A) 
RNA toward the centromere and enrichment in chromo- 
some arms, as expected based on density of protein-coding 
genes. However, low but detectable expression was also 
present in centromeres. 

To measure transcript abundance for the tandem repeat 
CentC and the retrotransposon CRM2, we counted the 
reads that matched to either element. Of the 26 071222 
genome-matching, poly(A)-enriched reads, 4 matched 
CentC and 318 matched CRM 2, and only a handful of 
these mapped uniquely. An analysis of the non-unique 
CRM2 aUgnments on centromere 2 suggested that most 
of the transcripts were derived from the centromere core 
areas (Supplementary Figure S3), as expected based on the 
distribution of CRM 2 elements (27). We also observed a 
substantial number of siRNAs derived from CentC and 
CRM 2: In a sample of 1 550 399 total genome-matching 
siRNA reads, 127 matched CentC and 545 matched 
CRM 2 (Supplementary Figure SIC and D). These 
siRNAs are likely derived from non-polyadenylated or 
otherwise unstable templates. CentC transcripts of 
unusual sizes have also been detected by RNA blotting 
and in sequence Hbraries from non-poly(A)-enriched 
RNA (56,59,60). Based on the centromeric location of 
CentC DNA (27), and the physical association of CentC 
transcripts with CENH3 chromatin (56), these data 
provide compelling evidence for transcription from 
centromeres in maize, though not in a form that is 
readily captured and sequenced by poly (A) enrichment. 
In contrast, evidence for S. pombe-typc transcription 
from pericentromeres is lacking in maize. 

Euchromatin histone modifications are generally 
thought to be depleted from pericentromeres, and in 
both S. pombe and multiple animal species, certain 
euchromatin modifications (H3K4me2 and H3K36me3) 
have been reported to be enriched with the centromere 
itself relative to pericentromeres (12,13,61,62). Such an 
enrichment is not observed by immunostaining in maize 
(28,29), nor in the filamentous fungus Neurospora crassa, 
where H3K4me2 is absent in centromeres but enriched in 
flanking areas (19). The Wang et al., study from which we 
derived our analysis of poly(A) RNA fragments also 
included a large-scale ChiP with antibodies for four 
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euchromatin histone modifications. We used the reads 
from this study to ask whether an enrichment could be 
seen in centromeres, or whether any clear difference 
between centromeres and pericentromeres could be 
detected (Figure 3 and Supplementary Figure S3). While 
some reduction in H3 modifications is expected in 
centromeres by virtue of CENH3 occupancy, all four of 
the modifications examined — H3K4me3, H3K36me3, 
H3K27me3 and H3K9ac — were generally depleted 
within both centromeres and pericentromeres, similar to 
24-nt siRNAs, poly(A) RNA, density of protein coding 
genes, and crossover suppression, and opposite to that 
of CG and CHG methylation. The chromosome landscape 
visible from these views is consistently the centromere at 
the summit of the hill or bottom of the valley, and the 
pericentromere occupying the slopes on either side. The 
landscapes are notably gentle, though highly variable, and 
if not for the presence of CENH3, we would find it diffi- 
cult to pinpoint the locations of centromeres by standard 
epigenetic profiling. 



DISCUSSION 

Using genome-wide epigenetic profiHng, we found that 
maize pericentromeres differed little from centromeres 
(with the key exception that CENH3 replaces canonical 
of H 3 in the kinetochore domain). There are many caveats 
to this study relating to the repetitive nature of centro- 
meres and the fact that only two maize centromeres are 
fully assembled, and none are fully sequenced base for 
base. The results apply most firmly to centromeres two 
and five where nearly complete sequences are available. 
In the most conservative sense, our data can be viewed 
as pertaining to regions that are unique by a 36-nt or 
less sequence tag. However, despite the short size of the 
reads, a large proportion can be aUgned to regions within 
the centromere core and flanking pericentromeric areas 
(e.g.. Figure 2E and IC). 

If arms are viewed as one extreme and centromeres as 
the other, pericentromeres represent an intermediate state 
but are much more similar to centromeres than to arms. 
This general trend was true for siRNAs, DNA methyla- 
tion, poly(A)-enriched RNA, and for four histone 
modifications — H3K4me3, H3K36me3, H3K27me3 and 
H3K9ac. These results are similar to what has been 
described in Caenorhabditis elegans, Neurospora crassa, 
Candida albicans, Saccharomyces cerevisiae and many 
other species in that they lack defined pericentromeric 



heterochromatin regions. It is of course possible that 
other marks or specific combinations of marks delineate 
the pericentromere in these species, but such an unidenti- 
fied code would not generally be classified as hetero- 
chromatin as we understand it. 

Our data showing that maize heterochromatin gradual- 
ly decreased in relation to the distance from the centro- 
mere differs substantially from what has been shown in 
several widely studied species that have an enrichment of 
heterochromatin directly flanking centromeres, including 
S. pombe, Drosophila melanogaster, A. thaliana, and 
humans. However, a close look at these systems suggest 
that they are dissimilar in fundamental ways [reviewed in 
(16,17)]. For example, HPl performs a critical function of 
ensuring cohesion at S. pombe pericentromeres but 
appears to be dispensable for cohesion in humans 
(20-22) and in A. thaliana (23-25). Furthermore, centro- 
meres are orders of magnitude smaller in S. pombe than in 
these other species. These data suggest that while hetero- 
chromatin may frequently be found in close proximity to 
centromeres, its role in chromosome segregation varies 
among species. S. pombe may provide an extreme 
example with its dependence on heterochromatin for de 
novo deposition of CENH3 and for proper cohesion 
dynamics (4,5,8,9) 

In contrast to pericentromeric heterochromatin, 
CENH3 is always found at active centromeres, making 
it a prime candidate as regulator of its own deposition, 
although perhaps indirectly [reviewed in (1,2)]. The most 
compelling arguments in favor of the view that centro- 
meres and pericentromeres can function independently 
comes from cases where centromeres are known to have 
moved to new locations. In the yeast S. cerevisiae, centro- 
meres that have been empirically moved to chromosome 
arms are sufficient for apparently normal cohesion 
behavior flanking the new centromere (63). Furthermore, 
it is well estabhshed that human neocentromeres can form 
on chromosome arms and in the absence of both canonical 
pericentromeric DNA sequence and of visible flanking 
heterochromatin. Conversely, when a centromere has 
been epigenetically deactivated on a dicentric maize 
chromosome, pericentromeric phosphorylation of H3 
serine 10 and cohesion are lost, demonstrating a reliance 
on the centromere (64). We emphasize, however, that 
CENH3 must interact with H3 at a local level since they 
are in close proximity and H3 is present in centromere 
cores. For example, multiple experiments with human 
artificial chromosomes have revealed interactions 



Figure 2. Continued 

below) could be biased toward the better sequenced centromeres. Several chromosomes have errors in their physical maps leading to substantial 
CENH3 signals on arms (black Hne). (C) Average DNA methylation in unique loci in each chromosome domain. Included are both percent of 
methylated cytosines per total nucleotides, as in (A), and percent of methylated cytosines per total cytosines in each context. Centromeres are the 
domains occupied by CENH3, as defined by the CENH3 ChIP reads. For the purposes of this study, pericentromeres extend five MB on either side 
of the centromeres, and arms are the rest of the chromosomes beyond the pericentromeres. Error bars are standard deviations arising from 
differences in the values from the 10 chromosomes. (D) DNA methylation in repetitive loci. CentC is the tandem repeat of maize centromeres, 
CRM2 is the dominant centromere retrotransposon, and Knob is a tandem repeat found on arms. Shown are both percent of methylated cytosines 
per total nucleotides, as in (A), and percent of methylated cytosines per total cytosines in each context. (E) Abundance of bisulfite reads. Raw counts 
are depicted for each chromosome domain (uniquely aligning reads in gray) and for repeats (blue). Due to enrichment for CENH3 domains by ChIP 
prior to bisulfite treatment and sequencing, a substantial number of uniquely aligning reads come from centromeres, despite the high density of 
repeats in centromere repeats. 
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Figure 3. Poly(A) RNA and euchromatin histone modification patterns. (A) Chromosome-wide view of reads from poly(A) RNA fragments and 
from Chip with antibodies against H3K4me3, H3K36me3, H3K27me3 and H3K9ac [derived from (58)]. Also included are a set of control, genomic 
DNA fragments (39). All counts are normalized per milhon reads. Note that some peaks extend off the chart. The CENH3 reads mark unique loci 
within centromeres and are derived from CENH3 ChIP [described in (27)]. See also Supplementary Figure S3. (B) Average read count for each 
chromosome domain. Centromeres are the domains occupied by CENH3, as defined by the CENH3 ChIP reads. For the purposes of this study, 
pericentromeres extend 5 MB on either side of the centromeres, and arms are the rest of the chromosomes beyond the pericentromeres. In order to 
lessen biases from errors in the physical map and Illumina sequencing, all values were normalized by the same genomic DNA fragment control reads 
as displayed in Figure IC. In this case, the control reads were trimmed to 36 nt to match the read length of the poly(A)-enriched RNA and histone 
modification reads. For the unique ahgnments, this normalization also removed the bias against repetitive regions, since the control reads were 
subject to the same unique ahgnment criteria. Error bars are standard deviations arising from differences in the values from the 10 chromosomes. 
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Figure 4. Models for centromere/pericentromere interactions. The arrows in each diagram depict proposed directions of causahty, either arising from 
centromere (center circle) or pericentromere (flanking rectangles). 



between CENH3 and the local chromatin environment 
(61,65-67). 

Our findings reveal that the centromere is the single 
most prominent feature of the entire centromere- 
pericentromeric region in maize, and suggest that 
CENH3 alone may be responsible for functions often 
ascribed to pericentromeres (summarized in Figure 4). 
Cohesion may be controlled indirectly by kinetochores, 
and the vast domain of recombination suppression 
flanking centromeres may also be an indirect outcome of 
the total recombination block within centromere cores 
(68) and other mechanical constraints on recombination 
events that He in close proximity to centromeres at meiosis 
[reviewed in (17)]. We emphasize, however, that our data 
do not demonstrate a mechanistic connection between the 
centromere and whole-chromosome patterns of RNA 
expression and chromatin marks. Furthermore, our data 
leave open the possibiHty that specific epigenetic marks 
such as phosphorylation or other transient modifications 
can be associated with pericentromeric areas during 
mitosis (6,7). What seems clear is that pericentromere 
structures vary dramatically across species but peri- 
centromere function is conserved. Our data, combined 
with multiple studies pointing to similar conclusions, 
suggest that new models are needed to explain how and 
why kinetochores are invariably flanked by reduced re- 
combination and regulated retention of cohesion. 
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